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Abstract 

We  investigate  the  small  sample  properties  of  three  alternative  GMM  estimators  of  asset 
pricing  models.  The  estimators  that  we  consider  include  ones  in  which  the  weighting  matrix 
is  iterated  to  convergence  and  ones  in  which  the  weighting  matrix  is  changed  with  each 
choice  of  the  parameters.  Particular  attention  is  devoted  to  assessing  the  performance  of 
the  asymptotic  theory  for  making  inferences  based  directly  on  the  deterioration  of  GMM 
criterion  functions. 
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1      Introduction 

The  purpose  of  this  paper  is  to  investigate  the  small  sample  properties  of  generalized  method 
of  moments  ( GM.\f)  estimators  applied  to  asset  pricing  models.  Our  alternative  asymptot- 
ically efficient  estimators  include  ones  in  which  the  weighting  matrix  is  estimated  using  an 
initial  (consistent)  estimator  of  the  parameter  vector,  ones  in  which  the  weighting  matrix  is 
iterated  to  convergence  and  ones  in  which  the  weighting  matrix  is  changed  for  every  hypo- 
thetical parameter  value.  The  last  of  these  three  approaches  has  not  been  used  very  much 
in  the  empirical  asset  pricing  literature,  but  it  has  the  attraction  of  being  insensitive  to  how 
the  moment  conditions  are  scaled.  In  addition,  we  study  the  advantages  to  basing  statistical 
inferences  directly  on  the  criterion  function  rather  than  on  quadratic  approximations  to  it. 
We  address  the  following  issues: 

•  How  does  the  procedure  for  constructing  the  weighting  matrix  affect  the  small  sample 
behavior  of  the  GMM  criterion  function? 

•  What  are  the  small  sample  properties  of  confidence  regions  of  parameter  estimators 
constructed  using  the  GMM  criterion  function,  compared  to  confidence  regions  con- 
structed using  standard  errors? 

•  Is  the  small  sample  over-rejection  often  found  in  studies  of  GMM  estimators  reduced 
when  using  an  estimator  in  which  the  weighting  matrix  is  continuously  altered? 

•  How  are  the  small  sample  biases  of  the  GA/.V/ estimators  effected  by  the  choice  of 
procedure  for  constructing  the  weighting  matrix? 

Since  there  has  been  an  extensive  body  of  empirical  work  investigating  the  consumption- 
based  intertemporal  capital  asset  pricing  model  using  GMM  estimation  methods,  we  use 
such  models  as  a  laboratories  for  our  Monte  Carlo  experiments.  .\s  in  Tauchen  (1986)  and 
Kocherlakota  (  1990b].   all  of  our  experiments  come  from  single  consumer  economies  with 
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power  utility  functions.  Within  the  confines  of  these  economies,  there  is  still  considerable 
flexibility  in  the  experimental  design.  Some  of  our  experimental  economies  are  calibrated 
to  annual  time  series  data  presuming  a  century  of  data.  Other  economies  are  calibrated 
to  monthly  post  war  data.  In  the  experiments  calibrated  to  annual  data  and  several  of 
the  experiments  calibrated  to  monthly  data,  the  moment  conditions  are  nonlinear  in  at 
least  one  of  the  parameters  of  interest.  Moreover,  some  of  the  specifications  introduce  time 
nonseparabilities  in  the  consumer  preferences  which  are  motivated  by  either  local  durability 
or  habit  persistence.  In  the  other  experiments  calibrated  to  monthly  data,  the  moment 
conditions  are,  by  design,  linear  in  the  parameter  of  interest.  Some  of  these  setups  are  special 
cases  of  the  classical  simultaneous  equations  model.  For  other  setups,  the  observed  data  is 
modeled  as  being  time  averaged,  introducing  a  moving-average  structure  in  the  disturbance 
terms. 

The  paper  is  organized  as  follows.  Section  2  describes  the  alternative  estimators  we  study 
and  the  related  econometric  literature.  Section  3  specifies  the  Monte  Carlo  environments  we 
use.  Section  4  gives  an  overview  of  the  calculations  including  a  description  of  how  inferences 
are  made  based  directly  on  the  shape  of  the  criterion  functions.  Section  -5  then  presents  the 
results  of  the  Monte  Carlo  experiments  calibrated  to  monthly  data  that  are  linear  in  the 
parameter  of  interest.  Section  6  presents  the  results  calibrated  to  annual  and  monthly  data 
in  which  the  moment  conditions  are  nonlinear  in  the  parameters.  Finally,  our  concluding 
remarks  are  in  section  7. 

2      Alternative  Estimators  and  Related  Literature 

One  ot  the  goals  of  our  study  is  to  compare  the  finite  sample  properties  of  three  alterna- 
tive GMM  estimators,  each  of  which  uses  a  given  collection  of  moment  conditions  in  an 
asymptotically  efficient  manner.  Write  the  moment  conditions  as: 

Ey(X,.3)]=0  (2.1) 


where  l3  is  the  k  dimensional  parameter  vector  of  interest.  In  (2.1)  the  function  ip  has  n  >  k 
coordinates.  We  assume  that  {4fT.J=i'T^i-^t,  ^)}  converges  in  distribution  to  a  normally 
distributed  random  vector  with  mean  zero  and  covariance  matrix  V  (/i). 

Let  Vjifi)  denote  (an  infeasible)  consistent  estimator  of  this  covariance  matrix.  This 
latter  estimator  is  typically  made  operational  by  substituting  a  consistent  estimator  for  /3, 
denoted  {bj}.  .\n  efficient  GMM estimator  of  the  parameter  vector  l3  is  then  constructed  by 
choosing  the  parameter  vector  b  that  minimizes: 

T  T 

The  first  two  GMM  estimators  that  we  consider  differ  in  the  way  in  which  this  is  accom- 
plished. 
Tu-Q-step  Estimator 

The  first  estimator,  called  the  two-step  estimator,  uses  an  identity  matrix  to  weight  the 
moment  conditions  so  that  bj  is  chosen  to  minimize: 

T  T 

[l.Y.^{Xub)\'[\;Y.^{X,M\-  (2-3) 

Let  6y-  denote  the  estimator  obtained  by  minimizing  (2.2). 
Iterative  Estimator 

The  second  estimator  continues  from  the  two-step  estimator  by  reestimating  the  matrix 
\'{3)  using  V'(6j~')  and  constructing  a  new  estimator  frj-.  This  is  repeated  until  bj  converges 
or  until  j  attains  some  large  value.  Let  6'^  denote  this  estimator. 
Continuous-updating  Estimator 

Instead  of  taking  the  weighting  matrix  as  given  in  each  step  of  the  GMM  estimation. 
we  also  consider  an  estimator  in  which  the  covariance  matrix  is  continuouslv  altered  as  b  is 


changed  in  the  minimization.  Formally  let  bj-  be  the  minimizer  of: 

T  T 

[\;Y:AX,M'[yT{h)V[\;Y.^{X,M-      ■  (2-4) 

Allowing  the  weighting  matrix  to  vary  with  b  clearly  alters  the  shape  of  the  criterion  function 
that  is  minimized.  While  the  first-order  conditions  for  this  minimization  problem  have  an 
extra  term  relative  to  problems  with  a  fixed  weighting  matrices,  this  term  does  not  distort  the 
limiting  distribution  for  the  estimator.  [See  Pakes  and  Pollard  (1989,  pages  1044-1046)  for 
a  more  formal  discussion  and  provision  of  sufficient  conditions  that  justify  this  conclusion.] 
An  advantage  of  this  estimator  relative  to  the  previous  two  is  that  it  is  invariant  to  how  the 
moment  conditions  are  scaled  even  when  parameter  dependent  scale  factors  are  introduced. 
A  simple  example  of  a  continuous-updating  estimator  is  a  minimum  chi-square  estimator  used 
for  restricted  multinomial  models  in  which  the  efficient  distance  matrix  is  constructed  from 
the  probabilities  implied  by  the  underlying  parameters  and  hence  is  parameter  dependent. 
The  three  GMM  estimators  have  antecedents  in  the  classical  simultaneous  equations 
literature.  Consider  estimating  a  single  equation,  say 

yt  =  ^'xt  -f  ut  (2.5) 

where  3  is  the  parameter  of  interest.  Let  Zt  denote  the  vector  of  predetermined  variables 
at  time  t  which  by  definition  are  orthogonal  to  U(.  One  way  to  estimate  3  is  to  use  two- 
stage  least  squares  which  is  our  two-step  estimator  under  the  additional  restrictions  that  the 
disturbance  term  is  conditionally  homoskedastic.  and  serially  uncorrected.  In  this  case  the 
iterative  estimator  converges  after  two-steps  and  hence  is  the  two-step  estimator.  It  is  well 
known  that  the  two-stage  least  squares  estimator  is  not  invariant  to  normalization.  In  fact 
Hillier  ( 1990)  criticized  the  two-stage  least  squares  estimator  by  arguing  that  the  object  that 
is  identified  is  the  direction      1     —  .i 


but  not  its  magnitude.  Hillier  then  showed  that  the 
conventional  two-stage  least  squares  estimator  of  direction  is  distorted  by  its  dependence  on 


normalization. 

As  an  alternative,  Sargan  (1958)  suggested  an  instrumental  variables  type  estimator  that 


uiiuiimzes 


-1    .    ^T 


(2.6) 


by  choice  of  b.  Under  the  additional  restrictions  imposed  on  the  disturbance  term  above, 
this  is  our  continuous-updating  estimator.  Notice  that  if  we  ignore  the  denominator  term 
in  (2.6)  and  minimize,  the  solution  is  the  two-stage  least  squares  estimator.  By  including 
the  denominator  term,  Sargan  showed  that  for  an  appropriate  choice  of  ~(,  the  solution  is 
the  (limited  information)  quasi-maximum  likelihood  estimator  (using  a  Gaussian  likelihood), 
which  as  an  estimator  of  direction  is  invariant  to  normalization.^ 

The  estimation  environments  that  we  study  are  more  complicated  than  the  one  just 
described.  Sometimes  the  moment  conditions  are  not  linear  in  the  parameters,  and  the 
disturbance  terms  are  often  conditionally  heteroskedastic  and/or  serially  correlated.  .'Xs  a 
consequence,  the  two-step  and  iterative  estimators  no  longer  coincide.  However,  the  estima- 
tion methods  remain  limited  information  in  that  the  moment  conditions  used  are  typically 
not  sufficient  to  fully  characterize  the  time  series  evolution  of  the  endogenous  variables. 

The  second  goal  of  our  analysis  is  to  compare  the  reliability  of  confidence  regions  com- 
puted using  quadratic  approximations  to  criterion  functions  to  ones  based  directly  on  the 
deterioration  of  the  original  criterion  functions.  The  former  approach  is  more  commonly 
used  in  the  empirical  asset  pricing  literature  partially  because  it  is  easier  to  implement.  The 
latter  approach  exploits  the  chi-square  feature  of  the  appropriately  scaled  criterion  functions. 
From  the  vantage  point  of  hypotheses  testing,  the  plausibility  of  an  observed  deterioration 
ot  the  criterion  function  caused  by  imposing  parameter  restrictions  can  be  assessed  by  using 
the  appropriate  chi-square  distribution.    Using  this  same  insight,  confidence  regions  can  be 


^S^e  Imbens  (1992)  for  an  alternative  GMM  estimator  that  is  invariant  to  normalization.   Imbens'  esti- 
mator IS  constructed  to  coincide  with  the  maximum  likelihood  estimator  when  the  data  is  multinomial 


computed  by  using  the  appropriate  chi-square  distribution  to  prespecify  some  increment  in 
the  criterion  function  and  inferring  the  set  of  parameter  values  that  imply  no  more  than  that 
increment.  Such  confidence  regions  can  have  unusual  shapes  and,  in"  fact,  may  not  even  be 
connected.  One  of  the  key  questions  of  this  investigation  is  whether  or  not  they  lead  to  more 
reliable  statistical  inferences. 

One  of  our  reasons  for  studying  the  performance  of  criterion-function-based  inference 
comes  from  the  work  of  Magdalinos  (1994).  Within  the  confines  of  the  classical  simultaneous 
equations  paradigm,  Magdalinos  studied  the  performance  of  alternative  tests  of  instrument 
admissibility.  As  a  result  of  his  analysis,  Magdalinos  recommended  altering  the  weighting 
matrix  to  embody  the  restrictions  as  is  done  in  the  continuous-updating  method.  In  addi- 
tion, he  found  that  test^-atistics  are  better  behaved  lusing  the  limited  infopmation  maxykiunT 
likelihood  estimator  than  the  two-stage  least  squares  estimator.  Recall  that  the  former  esti- 
mator coincides  with  our  continuous-updating  estimator  and  the  later  to  our  two-step  and 
iterated  estimators  in  the  classical  simultaneous  equations  estimation  environment  consid- 
ered by  Magdalinos.  Another  reason  is  Nelson  and  Startz's  (1990)  criticism  of  the  use  of 
instrumental  variables  methods  for  studying  consumption-based  asset  pricing  models.  These 
authors  were  concerned  about  the  behavior  of  instrumental  variables  estimators  when  the 
instruments  are  poorly  correlated  with  the  endogenous  variables.  Their  arguments  are  based 
on  analogies  to  results  derived  formally  for  t-statistics  and  over-identifying  restrictions  tests 
in  the  classical  simultaneous  equations  setting.  The  question  of  interest  to  us  is  the  extent 
to  which  criterion-function  based  inference  and  continuous  updating  can  help  overcome  the 
concerns  of  Nelson  and  Startz. 

The  finite  sample  properties  of  the  two-step  and  iterative  GMM  estimators  in  an  asset- 
pricing  setting  have  been  studied  previously  by  Tauchen  (1986).  Kocherlakota  (1990b). 
and  Person  and  Foerster  (1991).  These  investigators  did  not  study  the  properties  of  the 
continuous-updating  estimator,  nor  did  they  study  the  behavior  of  criterion  function-based 
confidence  regions.    Further  Tauchen  (1986)  and  Kocherlakota  (1990b)  considered  only  the 


case  of  time  separable  preferences  for  the  representative  consumer. 

3      Monte  Carlo  Environment 

We  consider  several  Monte  Carlo  environments  to  assess  the  finite  sample  properties  of  the 
estimators  described  in  section  2.  The  data  generating  mechanisms  are  constructed  to  be 
consistent  with  a  representative  agent  consumption-based  asset  pricing  model  {CCAPA'f). 
Estimators  of  the  parameters  of  the  representative  agent's  utility  function  are  considered 
along  with  tests  of  the  overidentifying  conditions  implied  by  the  model.  We  use  the  CCAPM 
as  the  basis  of  our  experiments  because  GMM  has  been  used  extensively  in  studying  this 
model. ■^  Further,  the  CCAPM  forms  the  basis  for  two  studies  of  the  finite  sample  properties 
«f  GMM  cohducfced  by  TaiMen  (1986)  and  Koc^rMota  (1990b).      ^>  ■  ^     '  '^ms.  W 

Preferences  and  Euler  Equations 

In  the  model  the  representative  consumer  is  assumed  to  have  preferences  over  consump- 
tion given  by: 


Uo  =  E 


h         1-7 


,7>0  (3.1) 


where  C;  is  consumption  at  date  t.  The  parameter  9  captures  some  time  nonseparability  in 
preferences.''  If  ^  >  0,  consumption  is  durable  or  substitutable  over  time.  If  ^  =  0.  the 
preferences  of  the  consumer  are  time  additive.  If  ^  <  0,  consumption  is  complementary  over 
time  and  the  preferences  of  the  representative  consumer  exhibit  habit  persistence. 

We  consider  estimators  of  the  parameters  8,  7  and  9  as  well  as  tests  of  the  model  based 
on  implications  of  the  Euler  equations.  Let  must  =  (C(  +  9ct-\)~'' .  which  can  be  interpreted 
as  the  indirect  marginal  utility  for  consumption  "services""  as  measured  by  St  —  Ct  -\-  9ct-\  . 


-See.  for  e.xample.  Dunn  and  Singleton  ( 1986),  Eichenbaum  and  Hansen  ( 1990),  Epstein  and  7in  ( 1991), 
Person  and  Constantinides  (1991),  and  Hansen  and  Singleton  (1982). 

•'For  models  with  time  nonseparability  in  preferences  see,  for  example.  Abel  ( 1990),  Constantinides  ( 1990), 
Detemple  and  Zapatero  (1991).  Dunn  and  Singleton  (1986),  Eichenbaum  and  Hansen  (1990),  Gallant  and 
Tauchen  (1989).  Heaton  (1993.  1994),  Novales  (1990),  Ryder  and  Heal  (197.3)  and  Sundaresan  (1989). 


Similarly,  let  muct  =  (c,  +  9ct-i)  ^  +  96E[{ct+i  +  6ct)  ^  |  J-t]  where  Tt  gives  the  information 
set  at  time  t.  The  Euler  equation  for  a  representative  agent's  portfolio  allocation  decision  is 
given  by: 

muct  =  E{Smuct^iRt+\  \  Tt)  (3-2) 

where  /?(+i  is  a  gross  return  on  an  asset  from  <  to  <  +  1.  Since  aggregate  consumption  is 
growing  over  time  we  divided  (3.2)  by  must  to  induce  stationarity.  The  (normalized)  Euler 
equation  that  we  consider  is  then  given  by: 


i^  =  £U 


(,=±ifl,„  I  r) .  (3.3) 

Removing  conditional  expectations  from  (3.3)  results  in  the  Euler  equation  error: 


A«(^.7.<')  =  =^-«=^fl,.,  (3.4) 

must  must 

where  muc'  =  [ct +8ct-i)~'' +  S6{ct+i +9ct)~'^ .  Notice  that  E((pt+2  \  ^t)  =  0-  Further  notice 
that  when  9  is  not  zero,  (f)t+2  has  an  MA(1)  structure.  By  choosing  instruments,  r(,  in  JF(, 
unconditional  moment  conditions  are  given  by: 

E[^t+2(Sn^9)]  =  E[zt<Pt+2{Sn.9)]^0.  (3.5) 

Finally,  notice  that  (i)t+2  can  be  expressed  in  terms  of  consumption  ratios  and  returns,  which 
we  take  to  be  stationary  processes. 

One  unpleasant  feature  of  these  moment  conditions  is  that  they  can  always  be  made 
to  be  satisfied  in  a  degenerate  fashion.  Suppose  that  -)  =  0  and  S9  =  —1.  then  clearly 
inuc'  =  0  and  the  moment  conditions  are  trivially  satisfied.  Without  imposing  additional 
constraints  on  the  parameter  vectors,  this  degeneracy  in  the  moment  conditions  creates 
problems  for  the  two-step  and   iterative  estimators.     Of  course  when   time-separability  is 
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imposed  (0  =  0),  these  problematic  parameter  values  cannot  be  reached.  When  9  is  permitted 
to  be  different  from  zero,  Eichenbaum  and  Hansen  (1990)  were  led  to  divide  the  moment 
conditions  by  1  +  ^^  in  order  that  the  two-step  and  iterative  estimators  not  be  driven  to  the 
degenerate  values.  We  will  do  likewise.  .An  attractive  property  of  the  continuous-updating 
estimator  is  its  insensitivity  to  parameter  dependent  scale  factors  and  hence  to  the  moment 
transformation  used  by  Eichenbaum  and  Hansen  (1990).  Moreover,  the  criterion  of  the 
continuous-updating  estimator  does  not  necessarily  tend  to  zero  in  the  vicinity  7  =  0  and 
S$  =  —1.  Although  the  estimated  mean  for  {(^(+2}  becomes  small  in  the  neighborhood  of 
these  parameter  values  so  does  the  estimated  asymptotic  covariance  matrix,  and  the  criterion 
function  for  the  continuous-updated  estimator  plays  off  this  tension. 

We  buildiseveral  Monte  C^lol^nviconments  tee  simulate  returns  and  consumpti#t-graw^h  .. 
that  are  consistent  with  (3.3).    These  are  used  to  assess  the  finite  sample  properties  of  the 
estimators  of  section  2  based  upon  the  moment  conditions  (3.5). 

3.1      Log-Normal  Model 

Time  Additive  Model.  So  Time  Averaging 

In  our  first  Monte  Carlo  environment  we  model  consumption  growth  and  returns  directly 
by  assuming  that  they  are  jointly  log-normally  distributed  as  in  Hansen  and  Singleton  ( 1983). 
Let  Y{t)  —  [log(ct/c(_i )  log  R^f  log/?;]'  where  C(  is  aggregate  consumption  at  time  t.R^f  is 
the  gross  return  on  a  stock  index  at  time  t  and  R{  is  the  gross  return  on  a  bond  at  time  t  . 
We  assume  that: 

Yt  =  ^i  +  B{L)e,  (3.6) 

where  e,  is  a  normally  distributed  three  dimensional  random  vector  that  is  independent  over 
time,  has  zero  mean  and  covariance  matrix  /.  and  where  fi  is  the  mean  of  Vj.  Further  B(L)  is 
a  matrix  of  polynomials  in  the  lag  operator.  To  use  this  assumption  about  the  dynamics  of 
consumption  and  returns  along  with  the  Euler  equation  (3.2)  we  assume  that  the  preferences 


of  the  representative  agent  are  time  additive.  In  this  case  the  Euler  equation  error  for  each 
return  can  be  written  as: 

-  7log(ct+i/Q)  +  logi?(+i  -  K  =  rjt+i  (3.7) 

where  E{T}t+i  \  !Fi)  =  0  and  «;  is  a  constant  [see,  for  example,  Hansen  and  Singleton  (1983)]. 
The  relation  (3.7)  implies  a  set  of  restrictions  on  the  law  of  motion  (3.6).  To  impose  these 
restrictions  we  consider  several  finite  order  parameterizations  of  B(L)  and  use  the  methods 
described  by  Hansen  and  Sargent  (1991).  These  parameterizations  are  of  the  form: 

B(L)  =  ^  (3.8) 

a(L) 

where  C(L)  is  a  3  by  3  matrix  of  polynomials  in  the  lag  operator  and  a(L)  is  a  1  by  1 
polynomial  in  the  lag  operator.  We  restrict  the  polynomial  a(L)  to  be  second  order  and 
considered  several  different  orders  for  C(L). 

To  estimate  the  constrained  law  of  motion,  we  used  monthly  data  from  1959,2  to  1992,12. 
Aggregate  consumption  is  seasonally  adjusted  real  aggregate  consumption  of  nondurables 
plus  services  for  the  U.S.  taken  from  CITIBASE.  These  data  were  converted  to  a  per  capita 
measure  by  dividing  by  total  U.S.  population  for  each  month,  obtained  from  CITIBASE.  The 
equity  return  is  the  value  weighted  return  from  CRSP,  and  the  bond  return  is  the  Fama-Bliss 
risk  free  return  from  CRSP.  Each  of  these  return  series  was  converted  into  a  real  return  using 
the  implicit  price  deflator  for  nondurables  and  services  from  CITIBASE. 

For  simplicity  we  removed  the  sample  mean  from  the  vector  Vj  so  that  the  constants  in 
(3.6)  and  (3.7)  did  not  have  to  be  estimated.  As  a  result,  the  only  preference  parameter  to 
be  estimated  is  7.  The  results  of  estimating  the  law  of  motion  (3.6)  using  exact  Maximum 
Likelihood  for  different  orders  of  the  polynomial  C(L)  are  given  in  table  3.1.  The  column 
labeled  "Unrestricted  Log-Likelihood"  reports  the  log-likelihood  in  the  case  of  unrestricted 
estimation  of  the  polynomials  in  the  lag  operator.    The  columns  labeled  "Xo  Time  .\vg." 
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report  the  log-likelihood  and  the  estimated  value  of  7  under  the  restrictions  implied  by  (3.7).' 
Notice  that  there  is  substantial  improvement  in  the  log-iikelihood  in  moving  from  a  first  to 
a  second  order  polynomial  for  C(L)  in  the  unrestricted  case.  This  indicates  that  more  than 
a  first  order  polynomial  is  needed  for  C(L)  .  There  is  little  improvement  in  going  to  a  third 
order  polynomial  in  the  unrestricted  case. 

For  both  the  second  and  third  order  polynomial  cases  there  is  great  deterioration  in 
the  log-likehhood  when  the  model  restrictions  are  imposed.  This  is  consistent  with  the 
results  reported  by  Hansen  and  Singleton  (1983).  Also  there  is  little  improvement  in  the 
log-likelihood  in  moving  from  a  second  order  to  a  third  order  C(L).  For  this  reason  we  used 
the  point  estimates  from  the  restricted  model  with  a  second  order  C(L)  to  conduct  our  Monte 
^arlo  experiments  for  the  loff'Ttorfnal  model  with.-jaao  trime  averaging.  ^«8e   #■' 

In  assessing  the  finite  sample  properties  of  GMM  estimators  in  this  case  we  constructed 
•500  Monte  Carlo  samples  each  with  a  sample  size  of  400.  A  sample  size  of  400  approximates 
the  size  of  available  monthly  consumption  data.  We  constructed  moment  conditions  based 
upon  the  Euler  equation  errors  for  both  the  bond  and  the  stock  returns  simultaneously.  For 
each  Euler  equation  error  we  used  one  period  lagged  (log)  bond  and  (log)  stock  returns  and 
one  period  lagged  (log)  consumption  growth  as  instruments.  We  did  not  include  constants 
as  instruments  since  the  data  is  simulated  under  the  assumption  that  it  has  a  zero  mean. 

Time  Additive  Model  with  Time  Averaging 

As  a  further  data  generating  mechanism,  we  also  consider  an  example  in  which  the 
decision  interval  of  the  representative  agent  is  much  smaller  than  the  interval  of  the  data. 
Suppose  that  there  are  n  decision  periods  within  each  observation  period.  For  example  if 
the  representative  agent's  decision  interval  is  a  week  and  data  is  observed  monthly,  then  n 


^In  searching  for  the  maximized  log-likeHhood,  larger  values  of  the  log-likelihood  were  found  for  values  of 
*  larger  than  -50.  The  results  reported  in  table  3.1  for  the  constrained  models  correspond  to  local  maxima. 
We  used  the  local  maximizers  for  our  simulations  because  they  result  in  plausible  values  for  7. 
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would  be  approximately  4.  The  representative  agent's  utility  function  at  time  t  is  given  by: 


Ut^E 


h=0 


(^^^^) 


1-7 


-    1 


1-7 


J't 


(3.9) 


If  we  maintain  the  assumption  that  consumption  and  returns  are  jointly  lognormally  dis- 
tributed, the  Euler  equation  error  r]t+i  is  again  given  by  (3.7).  Moreover,  this  error  can  be 
decomposed  as: 


Vt+i  =  X!  C+iL 
h=i       " 


(.3.10) 


where 


C(+h  =  EiT]t+i  I  J^(+i)  -  E{T]t+i  I  ^^_^.h^) 


3.ii; 


In  this  environment,  we  presume  that  observed  consumption  does  not  correspond  to 
the  actual  point-in-time  consumption  the  representative  agent,  but  instead  is  an  average  of 
actual  consumption  over  one  unit  of  time.  Specifically,  suppose  that  observed  consumption, 
c^,  is  a  geometric  average  of  actual  consumption: 


n^f-i+^ 


l/n 


yh=i 


and  similarly  for  the  observed  return.  7?^.  Averaging  (3.7)  over  time  implies  that: 


7  log«^^/cn  +  log  /?:+,  =  ^-  +  -  E  E  C,-i  +  .+  - 


r  =  l  h=l 


Notice  that  in  this  case  the  Euler  equation  error: 


(3.12) 


[3.131 


^t+\  =  7  im  c,_i+i+i 


:1   h  =  l 


(3.14) 


is  predictable  at  time  /.  However  E{r]'^_^_^  |  /",_!)  =  0  so  that  instruments  can  be  chosen  from 


the  information  set  at  time  t  —  1.  An  instrumental  variables  estimator  of  this  model  must 
account  for  the  MA(1)  structure  of  the  moment  condition. 

We  estimated  the  log-linear  law  of  motion  (3.8)  for  consumption  and  asset  returns'^  under 
the  restriction  implied  by  (3.13).  Notice  that  this  model  imposes  a  weaker  set  of  restrictions 
than  does  the  model  that  takes  no  account  of  time  averaging.^  We  consider  the  case  of  a 
third  order  polynomial  for  C(L)  and  a  second  order  polynomial  for  a(L).  The  results  of  this 
estimation  are  reported  in  table  3.1  in  the  columns  labeled  "Time  Averaged."  Notice  that 
the  log-likelihood  function  improves  somewhat  compared  to  the  case  where  time  averaging 
is  ignored.  However  the  model  is  still  substantially  at  odds  with  the  data.  The  estimated 
value  of  7  is  slightly  larger  as  well. 

We  used  fthis  model  to  create-^00  Monte  Carl©  draws  each  with  a  sample  size  ^^OtT  "As  . 
in  the  case  of  no  time  averaging,  we  studied  estimators  based  upon  the  Euler  equations  for 
both  returns.   In  this  case  the  instruments  were  (log)  stock  returns,  (log)  bond  returns  and 
(log)  consumption  growth  all  lagged  2  periods. 

3.2      Discrete-State  Models 

In  our  second  set  of  Monte  Carlo  environments  we  follow  Tauchen  (1986)  and  Kocherlakota 
(1990b)  and  consider  a  Markov  chain  model  for  aggregate  consumption  and  dividend  growth. 
.Aggregate  consumption  is  assumed  to  represent  the  endowment  of  the  representative  con- 
sumer and  dividends  represent  the  cash  flow  from  holding  stock.  We  form  one-period  stock 
returns  and  the  returns  to  holding  a  one-period  (real)  discount  bond.  Each  of  these  returns 


"The  actual  consumption  data  is  an  arithmetic  average  of  consumption  expenditures  over  a  period. 
In  fitting  the  model  to  the  data  we  are  assuming  that  geometric  averages  and  arithmetic  averages  are 
appro.ximately  the  same.  This  assumption  was  made  by  Grossman.  .VIelino  and  Shiller  (1987),  Hall  (1988) 
and  Hansen  and  Singleton  (1993).  Notice  also  that  the  returns  in  (3.13)  are  time  averaged.  For  simplicity, 
in  estimating  the  law  of  motion  (3.8)  we  use  the  monthly  CRSP  series  directly.  Hansen  and  Singleton  (1993) 
constructed  time  averaged  returns  from  daily  data  in  their  analysis  of  this  model.  For  the  consideration  of 
the  finite  sample  properties  of  GMM  estimators,  this  is  unlikely  to  be  an  important  issue. 

'^'There  is  a  additional  restriction  on  the  first-order  autocorrelation  of  the  Euler  equation  error  for  the 
stock  return.  In  the  limit  case  of  continuous  decision  making,  the  first-order  autocorrelation  of  the  error 
should  be  0.2-5  as  discussed  by  Grossman.  Melino  and  Shiller  (1987)  and  Hall  (1988).  We  do  not  impose  this 
restriction  in  our  estimation. 
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can  be  represented  as  functions  of  the  state  of  the  Markov  chain.  Construction  of  these  re- 
turns in  the  case  of  time  additive  utility  is  described  by  Kocherlakota  (1990b)  and  Tauchen 
(1986). 

Annual  Model 

To  calibrate  the  first  Markov  chain  we  used  the  method  described  by  Tauchen  and  Hussey 
(1991)  to  approximate  a  first-order  VAR  for  consumption  and  dividend  growth.  The  param- 
eters of  the  VAR  are  taken  from  Kocherlakota  (1990b)  and  are  given  by: 


In  6 
lnA( 


0.004 
0.021 


+ 


0.117    0.414 
0.017    0.161 


In  6-1 
lnA(_i 


+  et 


(3.15) 


1^-    •  i?y    ' 

where  ^t  is  the  gross  growth  rate  of  real  annual  dividends  on  the  S&:P500,  where  A(  is  the 
gross  growth  rate  of  U.S.  per  capita  real  annual  consumption,  and  where 


E(et^[)  = 


0.01400  0.00177 
0.00177  0.00120 


[3.16) 


Further  e,  is  assumed  to  be  normally  distributed  and  uncorrelated  over  time.  The  Markov 
chain  for  [if(  A(]'  is  chosen  to  have  16  states. 

In  simulating  data  from  this  model  we  chose  several  values  of  the  preference  parameters 
of  the  representative  consumer.  These  are  presented  in  table  3.2.  Preference  setting  TSl 
was  used  by  Tauchen  (1986)  and  preference  setting  TS2  was  used  by  Kocherlakota  (1990b). 
.As  shown  by  Kocherlakota  (1990b),  these  latter  parameters,  along  with  the  Markov  chain 
model  of  endowments,  imply  first  and  second  moments  for  asset  returns  that  mimic  their 
sample  counterparts.  The  large  value  of  6  in  TS2  is  not  inconsistent  with  the  existence  of 
an  equilibrium  in  the  model  because  of  the  large  value  of  7  [Kocherlakota  (1990a)]. 

AV'e  restrict  our  attention  to  "moderate"  values  of  6  and  7  in  our  examination  of  time 
nonseparable  preferences,  and  we  consider  a  range  of  values  of  9.  Parameter  setting  TNSl 
introduces  a  modest  degree  of  durability  by  letting  9  =   1/3  and  TNS2  introduces  habit 
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persistence  with  d  =  —1/3.  TNS3  results  in  a  more  extreme  amount  of  habit  persistence  by 
setting  9  =  —2/3.  The  asymmetry  (in  magnitude)  across  the  specifications  of  9  is  guided 
in  part  by  the  a  priori  notion  that  there  should  only  be  limited  amount  of  durability  in 
the  goods  classified  as  "nondurable"  in  NIPA  and  by  Person's  and  Constantinides's  (1991) 
empirical  evidence  for  a  substantial  degree  of  habit  persistence. 

Since  there  is  great  flexibility  in  the  construction  of  moment  conditions,  we  chose  several 
sets  of  moment  conditions  that  differ  in  the  number  of  returns  and  instruments  that  are 
used.  The  alternative  sets  are  listed  in  table  3.3.  The  Euler  equation  for  each  listed  return 
wa^  multiplied  by  the  listed  instruments  to  construct  the  moment  conditions,  i?^  denotes  the 
return  to  holding  the  stock  and  Rf  to  holding  the  bond.  We  used  the  dividend-price  ratio, 
4tl Pf  ^  as  an'  instrument  in  •bmeht  set  M4  because  of  its  ability  to  predict  equity  E&t^ns^ 
GMM  estimates  and  test  statistics  were  computed  for  500  replications  of  a  sample  of  size 
100.  This  sample  size  corresponds  approximately  to  the  length  of  most  annual  data  sets. 

Monthly  Model 

We  repeated  some  of  the  experiments  using  a  law  of  motion  calibrated  to  postwar  monthly 
data.  As  in  the  construction  of  the  Markov  chain  for  the  annual  model,  we  started  with 
a  first-order  VAR  for  consumption  and  dividend  growth.  The  consumption  data  used  in 
estimating  the  \'AR  was  aggregate  U.S.  expenditures  on  nondurables  and  services  described 
in  subsection  3.1.  We  constructed  dividends  implied  by  the  monthly  CRSP  value- weighted 
N\  SE  portfolio  return.  These  dividends  were  converted  to  real  dividends  using  the  implicit 
price  deflator  for  monthly  nondurables  and  services  taken  from  CITIBASE.  This  dividend 
series  is  highly  seasonal  because  of  the  regular  dividend  payout  policies  of  most  companies. 
To  avoid  modeling  this  seasonality  we  let  ^,  =  \og{dt/dt-i2)/l'2  and  we  assumed  that  ^t 
represents  the  one-period  dividend  growth  of  the  model.  The  series  {\og(dt/dt-i2)}  appears 
to  be  stationarv. 
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The  parameters  of  the  VAR  estimated  using  this  data  are  given  by: 


In  6 
InXt 


0.0012 
0.0019 


+ 


-0.1768 
0.0267 


0.1941 
-0.2150 


In  6-1 
lnAt_i 


+  et 


(3.17) 


where 


E(etc[)  = 


0.1438 
0.0001 


0.0001 
0.0145 


X  10 


-3 


(3.18) 


As  in  the  case  of  the  annual  Markov  chain  model,  we  approximated  the  VAR  of  (3.17)  and 
(3.18)  with  a  16  state  Markov  chain  using  the  methods  of  Tauchen  and  Hussey  (1991).  The 
Monte  Carlo  data  consisted  of  500  replications  of  a  sample  size  of  400.  We  focused  exclusively 
on  the  more  "moderate" ^reSl^enc£ configuration  (TSl)  adjusting  6  for  the  she  .eT-^Sipiing 
interval.  (More  precisely,  we  used  the  twelfth  root  of  .97  in  place  of  .97  for  6.)  In  addition, 
we  generated  Monte  Carlo  data  using  nonseparable  specification  TNSl  and  TNS2.  again 
with  S  adjusted  appropriately. 

4      Overview  of  Descriptive  Statistics  for  Monte  Carlo 
Results 

In  describing  the  results  of  the  various  Monte  Carlo  experiments  in  sections  5  and  6,  we 
focus  most  of  our  discussion  on  the  following  calculations: 

1.  We  found  the  minimum  value  of  the  criterion  function.  Call  this  Jj-  The  limiting 
distribution  of  TJj  is  chi-square  with  degrees  of  freedom  equal  to  the  number  of  mo- 
ment conditions  minus  the  number  of  parameters  estimated.  We  used  this  limiting 
distribution  to  test  the  overidentifying  moment  conditions. 

2.  We  evaluated  the  criterion  function  at  the  true  parameter  vector.  Call  this  Jj.  The 
limiting  distribution  of  TJj  is  chi-square  with  degrees  of  freedom  equal  to  the  number 
of  moment  conditions.    With  this  limiting  distribution,  we  characterize  the  family  of 
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parameter  vectors  that  look  plausible  from  the  standpoint  of  the  moment  conditions. 
It  is  of  interest  to  see  how  well  th(^  limiting  distribution  performs  in  making  this 
assessment. 

3.  We  found  the  minimum  value  of  the  criterion  function  when  7  is  constrained  to  be  its 
true  value.  Call  this  Jj.  Since  7  is  the  only  parameter  estimated  in  the  log-normal 
model,  in  this  case  Jj  coincides  with  Jj.  The  limiting  distribution  of  T[Jj  —  Jt)  is  chi- 
square  with  one  degree  of  freedom.  This  limiting  distribution  allows  us  to  construct 
a  confidence  region  for  7  based  on  the  increments  of  the  criterion  function  from  its 
unconstrained  minimum.  By  evaluating  TiJy  —  Jt)  we  determined  whether  the  true 
value  of  7  is  in  the  resulting  interval  for  alternative  confidence  levels. 


4.  We  constructed  the  more  standard  confidence  intervals  for  7  based  on  a  quadratic 
approximation  to  the  criterion  function.  Let  77  be  an  estimator  of  7,  and  a^  be 
the  estimated  asymptotic  standard  error"  of  the  estimator  77.  Formally,  we  study 
T{lT  —  iYI{(^t)'  which  has  an  asymptotic  chi-square  distribution  with  one  degree  of 
freedom.  Notice  that  this  object  is  just  the  Wald  statistic  for  the  hypothesis  that  the 
true  value  of  the  parameter  is  7. 

Our  Monte  Carlo  calculations  are  greatly  simplified  by  our  knowledge  of  the  true  param- 
eter vector.  In  empirical  work,  the  corresponding  computations  would  be  more  complicated. 
For  instance,  to  construct  a  confidence  interval  for  7  based  on  the  original  criterion  function, 
a  researcher  would  have  to  characterize  numerically  the  hypothetical  values  of  this  parameter 
that  are  consistent  with  a  prespecified  deterioration  in  the  criterion  while  concentrating  out 
all  of  the  other  parameters.  When  there  are  very  few  remaining  components  in  the  parame- 
ter vector  (in  our  examples  zero,  one  or  two),  this  concentration  is  tractable.  However,  this 


'The  standard  errors  for  the  continuous-updating  estimator  were  based  only  upon  first  derivatives  of 
the  sample  moment  conditions  with  respect  to  the  parameters.  Standard  errors  constructed  in  this  way 
are  analogous  to  the  standard  errors  typically  constructed  in  GMM  estimation  [see  Hansen  (198"2)).  Xn 
alternative  way  to  proceed  would  be  to  base  the  standard  errors  on  derivatives  of  the  criterion  function  with 
respect  to  the  parameters. 
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approach  may  become  very  difficult  when  the  parameter  vector  is  large. 

In  reporting  our  Monte  Carlo  results  we  use  one  of  the  graphical  methods  advocated  by 
Davidson  and  McKinnon  (1994).  For  each  Monte  Carlo  setup  we  computed  the  empirical 
distributions  of  the  statistics  and  compare  them  to  the  corresponding  chi-square  distribu- 
tions. The  results  are  plotted  on  a  set  of  figures  constructed  as  follows.  For  each  probability 
value  (depicted  on  the  x-axis),  we  computed  the  corresponding  chi-square  critical  value  and 
the  fraction  of  the  actual  computed  statistics  that  are  above  that  value  (depicted  on  the 
y-axis).  Thus  the  45  degree  line  (depicted  as  ...  )  is  the  appropriate  reference  for  assessing 
the  quality  of  the  limiting  distribution.  Following  Davidson  and  McKinnon  (1994),  these 
plots  are  referred  to  as  "p-value"  plots,  and  we  present  the  results  for  the  interval  [0  0.5] 
since  this  bounds  the  re^&n  of  probability  values  used  in  most  applications.  Although  wJsHise  * 
probability  values  as  our  basis  of  comparison,  confidence  intervals  at  alternative  significance 
levels  can  be  assessed  by  simply  subtracting  the  probability  values  from  one. 

The  figures  are  organized  as  follows.  For  each  Monte  Carlo  setup  we  first  consider  a  single 
figure  with  four  graphs  titled  as  follows:  "Minimized",  "'True",  "Constrained  -  Minimized" 
and  "Wald"  corresponding  to  the  statistics  TJt.  TJj,  T(Jj  —  Jj)  ,  and  Tifj  —  7)^/(<Tj)^, 
respectively.  To  provide  a  formal  statistical  measure  of  the  distance  between  the  empirical 
distributions  and  their  theoretical  counterparts,  on  each  figure  a  band  about  the  45  degree 
line  is  plotted  using  dotted  lines.  This  band  is  a  90  percent  confidence  region  based  on 
the  Kolmogorov-Smirnov  Test.  This  states  that  the  probability  that  the  maximal  difference 
between  the  empirical  distribution  and  the  theoretical  one  will  lie  within  those  lines  is  90 
percent.  Maximal  differences  within  these  bands  are  not  statistically  significant  at  the  10% 
significance  level.** 

In  each  graph,  the  dashed  line  gives  the  Monte  Carlo  results  for  the  two-step  estimator, 
the  dot-dash  line  for  the  iterated  estimator  and  the  solid  line  for  the  continuous-updating 
estimator.  For  the  minimized  criterion  function  results  there  is  a  necessary  ordering  between 


■^The  Kolmogorov-Smirnov  confidence  region  is  based  on  calculating  the  supremum  between  the  empirical 
distribution  and  the  4-5  degree  line  over  the  region  [0,1]. 
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the  continuous-updating  and  iterative  estimator.  When  the  iterative  estimator  converges, 
the  value  of  the  criterion  function  ran  also  be  obtained  by  the  continuous-updating  estima- 
tor. Since  the  continuous-updating  estimator  minimizes  its  criterion,  this  minimized  value 
must  he  smaller  than  the  criterion  for  the  iterative  estimator.  As  a  result,  the  plot  for  the 
minimized  criterion  of  the  continuous-updating  estimator  must  lie  below  the  plot  for  the 
iterative  estimator  unless  the  iterative  estimator  fails  to  converge.  There  is  no  natural  or- 
dering between  the  results  for  the  two-step  estimator  and  the  continuous-updating  estimator 
or  between  the  two-step  estimator  and  the  iterative  estimator. 

To  complement  our  p-value  plots,  we  also  provide  some  results  summarizing  the  per- 
formance of  the  implied  parameter  estimators.    The  finite  sample  properties  of  the  point 
esstimates  are  of  interest  in  tifeir  own  right  and  in  3ome  cases  provide  additional  insight^Kit^ 
the  behavior  of  the  p-value  plots. 


5      Monte  Carlo  Results,  Log-Normal  Model 

For  the  case  of  time-averaged  data,  {i^(}  has  an  MA(1)  structure  as  we  discussed  in  sec- 
tion 3.1.  To  account  for  this,  the  estimator  of  Vrib)  was  computed  by  first  considering  an 
estimator  of  the  form: 

^T(b)  =  ^T.rT{x,.b)^Tix,.by  +  Y,yT(Xt-,,b)^T{Xt.by  +  ^T{Xt.b)^T{Xt-,.by]\ 

^      Kt-l  t=2  ) 

(o.l) 
where  ^j(  A'(.  6)  =  v(  A',,  b)  —  j  YJ=\  r(-^'(-  b).  When  this  estimator  was  not  positive  definite, 
we  used  an  estimator  proposed  by  Durbin  (1960)  [see  also  Eichenbaum.  Hansen  and  Singleton 
(1988)].^  Durbin's  estimator  is  obtained  by  first  approximating  the  MA(1)  model  with  a  finite 
order  autoregression.    The  residuals  from  this  autoregression  are  used  to  approximate  the 


■'In  the  Monte  Carlo  experiments  with  tune  averaging,  the  use  of  this  covariance  matri.x  estimator  was 
not  necessary  at  any  of  the  converged  parameter  estimates. 
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innovations.  Then  the  parameters  of  the  MA(  1)  model  are  estimated  by  running  a  regression 
of  the  original  time  series  onto  a  one-period  lag  of  the  "approximate"  innovations.  Finally, 
an  estimate  of  Vj(6)  is  formed  using  the  estimated  moving- average  coefficients  and  sample 
covariance  matrix  for  the  residuals.  This  procedure  has  the  advantage  that  the  finite-order 
moving  average  structure  of  {(^J  is  imposed  and  the  estimator  is  positive  semidefinite  by 
construction.  However  it  does  rely  on  the  choice  of  a  finite  order  autoregression  to  use  in 
the  approximation.  In  implementing  the  estimator  we  ran  a  12th  order  autoregression  in  the 
initial  stage. 

Criterion  functions 

Figures  5.1  and  5.2  reportJJae  properties  of  the  criterion  functions  for  the  two  Mact|;  Carlo 
experiments.  Figure  5.1  is  for  the  case  of  no  time  averaging  of  the  data  and  figure  5.2  is  for  the 
case  of  time  averaging.  The  lower  right  plots  in  the  figures  report  the  results  using  the  VVald 
(approximate  quadratic)  criteria.  The  results  for  the  Wald  and  "Constrained-Minimized" 
criteria  are  identical  for  the  Iterative  and  Two-Step  estimators.  This  occurs  because  the 
model  is  linear  in  the  parameters  and  the  weighting  matrix  is  fixed  in  constructing  (Jj  — 
Jt)-  For  the  continuous-updating  estimator  the  results  for  the  Wald  and  the  "Constrained- 
Minimized"  criteria  are  different  due  to  the  dependence  of  the  weighting  matrix  on  the 
hypothetical  parameter  values. 

Notice  that  the  small  sample  distributions  of  the  minimized  criterion  functions  for  the 
iterative  and  two-step  estimators  are  greatly  distorted.  The  small  sample  size  of  tests  of  the 
overidentifying  restrictions  based  upon  the  minimized  criterion  values  are  too  large  leading 
to  over  rejections  of  the  model  when  using  these  estimators.  The  minimized  criterion  func- 
tion for  the  continuous-updating  estimator  is  much  better  behaved  and  the  small  sample 
distribution  is  very  close  to  being  \^  for  both  Monte  Carlo  experiments.  Tests  of  the  overi- 
dentifying restrictions  of  the  models  using  the  minimized  value  of  the  criterion  function  of 
the  continuous-updating  estimator  have  the  correct  size  for  the  model  without  time  averag- 
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ing  and  similarly  for  the  model  with  time-averaging  for  probability  values  less  than  about 
0.1.  Even  for  probability  values  greater  than  0.1,  the  distribution  of  the  minimized  criterion 
for  the  continuous-updating  estimator  is  not  greatly  distorted. 

The  finite  sample  coverage  probabilities  of  the  two  ways  of  constructing  confidence  regions 
for  -y  are  depicted  under  the  headings  "Wald"  and  "Constrained- Minimized".  Recall  that 
these  two  methods  coincide  when  they  are  based  on  the  two-step  and  iterative  estimators, 
but  differ  when  the  continuous-updating  estimator  is  used.  The  small  sample  coverage 
probabilities  are  greatly  distorted  for  the  intervals  constructed  with  the  iterative  and  two- 
step  estimators.  In  particular,  they  do  not  contain  the  true  parameter  value  as  often  as  is  to 
be  expected  from  the  limiting  distribution.  In  the  case  of  the  continuous-updating  estimator, 
coverage  rates  again  are  too  smali'  for  confidence  intervals  built  from  the  Wald  cri^fria,*but 
the  distortion  is  substantially  smaller  than  with  the  other  two  estimators.  Finally,  the 
coverage  rates  of  the  confidence  regions  implied  by  the  "True-Minimized"  criteria  for  the 
continuous-updating  estimator  accord  well  with  the  asymptotic  distribution  and  are  clearly 
better  than  the  coverage  rates  for  the  other  three  criteria. 

These  Monte  Carlo  results  for  the  log-normal  model  support  the  following  remedy  for 
the  concerns  raised  by  Nelson  and  Startz  (1990).  From  the  standpoint  of  hypothesis  test- 
ing and  confidence  interval  construction,  use  of  the  continuous-updating  criterion  is  much 
more  reliable  than  the  other  methods  we  study.  The  tests  of  the  overidentifying  restrictions 
based  on  the  continuous-updating  estimator  do  not  reject  too  often  and  in  fact  are  quite  well 
approximated  by  the  limiting  distribution.  While  confidence  intervals  based  on  the  Wald 
criteria  can  be  badly  distorted,  particularly  for  the  two-step  and  iterative  estimators,  confi- 
dence regions  constructed  from  the  continuous-updating  criteria  have  coverage  probabilities 
that  are  close  to  the  ones  implied  by  the  asymptotic  theory. 

Parameter  Estimates 

Table  5.1  reports  summaries  of  measures  of  central  tendency  for  the  three  estimators  of 
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7  along  with  10%  and  90%  quantiles.  The  medians  for  the  two-step  and  iterative  estimators 
are  considerably  lower  than  the  true  value  of  7  while  the  median  bias  for  the  continuously- 
updated  estimator  is  much  smaller.  However,  the  distribution  for  the  continuous-updating 
estimator  is  also  more  disperse  as  evidenced  by  the  larger  increment  between  the  10%  and 
90%  quantiles.  Moreover,  the  Monte  Carlo  sample  means  for  the  continuous-updating  es- 
timator is  much  more  severely  distorted  than  they  are  for  the  other  two  estimators.  The 
enormous  sample  means  for  the  continuous-updating  estimator  occur  because  in  the  case 
of  no  time  averaging  and  of  time  averaging  there  were  23  and  31  samples,  respectively,  in 
which  the  estimates  are,  in  absolute  value,  larger  than  100.  When  these  are  removed  from 
the  Monte  Carlo  samples,  the  sample  means  of  the  continuous-updating  estimator  are  closer 
to  the  true  values  than  1^  the  means  for  the  other  two  estimators.  &'      ^ 

Recall  that  the  analog  to  the  two-step  and  iterative  estimator  in  the  classical  simultaneous 
equations  model  is  two-stage  least  squares  and  that  the  analog  to  the  continuous-updating 
estimator  is  limited  information  (quasi)  maximum  likeHhood.  It  is  known  from  the  literature 
that  there  are  settings  in  which  two-stage  least  squares  has  finite  moments  but  limited 
information  maximum  likelihood  does  not  [e.^.,  see  Sawa  (1969)  and  Mariano  and  Sawa 
(1972)].  In  light  of  these  theoretical  results  and  our  Monte  Carlo  findings,  the  continuous- 
updating  estimator  is  not  an  attractive  alternative  to  the  other  estimators  we  consider  if  our 
bases  of  comparison  are  the  (untruncated)  moment  properties  [or  even  relative  squared  errors 
as  in  Zellner  (1978)].  On  the  other  hand,  Anderson,  Kunitomo  and  Sawa  (1982)  advocated 
use  the  limited  information  estimator  over  the  two-stage  least  squares  estimator  because, 
among  other  things,  the  median  bias  of  the  former  estimator  is  smaller.  We  also  find  less 
distortion  in  the  medians  for  the  continuous-updating  estimator  in  our  experiments.^" 

.As  we  noted  previously,  one  attractive  attribute  of  the  continuous-updating  estimator  is 
its  invariance  to  ad  hoc  (parameter  dependent)  transformations  of  the  moment  conditions. 


'°Even  though  our  model  is  linear  in  variables  and  parameters,  the  continuous-updating  estimator  does 
not  coincide  with  limited  information  ma.ximum  likelihood  in  our  setting.  .A.mong  other  things,  we  use  a 
heteroskedasticity  consistent  estimator  of  Vj{h). 
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For  instance,  when  the  object  of  interest  is  a  "direction"  in  the  sense  of  Hillier  (1990), 
the  continuous-updating  estimator  is  invariant  to  normalization.  Hillier's  defense  of  limited 
information  maximum  likelihood  over  conventional  two-stage  least  squares  is  that  the  former 
is  a  better  estimator  of  direction.''  To  see  whether  such  a  conclusion  might  well  extends  to 
comparisons  between  the  continuously-updating  estimator  and  the  other  two  estimators  we 
consider,  we  report  smoothed  distributions  of  the  estimated  "direction"  in  figures  5.3  and 
5.4.  We  measure  direction  by  the  angle  (as  measured  in  radians)  between  the  horizontal 
axis  and  the  point  (1,7).  Since  there  is  still  a  sign  normalization  that  must  be  imposed  for 
identification,  we  restrict  attention  to  the  interval  [— 7r/2, 7r/2].  In  smoothing  the  histogram, 
we  used  Gaussian  kernel  with  a  bandwidth  of  0. 1 .  The  value  of  the  density  estimate  is  plotted 
a^.  each  of  the  sample  point#using  a  circle.  The  ^ape  of  the  smoothed  distribution  afengS* 
with  the  mass  of  the  plotted  circles  provides  evidence  about  the  small  sample  distribution 
of  the  parameter  estimators.  Notice  that  the  primary  modes  of  the  continuous-updating 
angle  estimator  are  very  close  to  the  true  parameter  values,  while  the  modes  of  the  other 
two  angle  estimators  are  distorted.  Moreover,  the  density  estimates  for  the  modal  angle  are 
larger  for  the  continuous-updating  method.  However,  the  Monte  Carlo  distributions  for  the 
continuous-updating  angle  estimator  also  have  secondary  modes  near  — 7r/2,  corresponding 
to  large  in  magnitude  estimates  of  7  with  the  wrong  sign. 

Continuous-Updating  Criterion  Function 

The  criterion  function  for  the  continuous-updating  estimator  can  sometimes  lead  to  ex- 
treme outliers  for  the  minimizing  value  of  7.  This  occurs  in  the  two  .Monte  Carlo  experiments 
for  some  of  the  trials  as  we  discussed  above.  To  see  why  this  can  occur  suppose  for  simplicity 
that  there  is  a  single  return  under  consideration,  no  time  averaging  and  several  instruments. 


"Hillier  (1990)  considered  an  alternative  GMM  estimator  of  direction  in  which  the  absolute  value  of 
the  parameter  vector  is  normalized  to  one  instead  of  one  of  the  elements  of  that  vector.  He  found  that 
this  alternative  estimator  \\ss  finite  sample  properties  that  compare  more  favorably  to  those  of  the  limited 

information  ma.ximum  likelihood  estimator. 
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The  moment  conditions  are  constructed  using: 

<l>{Xt,g)  =  [\ogRt+,-g\og{ct+,/ct)]zt       '  (5.2) 

where  Zt  is  a  vector  of  instruments  and  E[(i){Xt,f)]  =  0  [see  (3.7)].  Since  the  moment 
conditions  are  linear  in  g,  the  criteria  for  the  iterated  and  two  step  estimators  are  quadratic  in 
g.  In  contrast,  the  criterion  for  the  continuous-updating  estimator  converges  as  g  gets  large. 
To  see  this,  observe  that  for  a  large  value  of  g  the  sample  average  of  4>{Xt,g)  is  approximately 
g  times  the  sample  mean  of  log(Q^.i/c()^(  and  the  sample  covariance  is  approximately  g^ 
times  the  sample  covariance  of  \og(ct+i/ct)zt  .  Therefore  for  large  g,  the  criterion  function 
is  approximately  a  quadratic  form  that  is  (4)  times  the  chi-square  test  statistic  for  the  null 
hypothesis  that 

£[log(Q+i/Q)r,]  =  0.  (5.3) 

As  a  result  it  is  possible  for  the  minimized  criterion  for  the  continuous-updating  estimator 
to  occur  for  a  very  large  value  of  g. 

To  further  illustrate  this  potential  problem,  the  upper  plot  in  figure  5.5  is  of  the  criterion 
function  for  the  continuous-updating  estimator  for  a  Monte  Carlo  draw  in  which  the  value 
of  g  that  minimizes  the  criterion  function  is  673750.4.  The  lower  plot  in  figure  5.5  gives 
the  average  value  of  the  criterion  function  over  the  Monte  Carlo  experiments.  These  results 
are  for  the  case  of  no  time  averaging.  Notice  that  in  the  upper  plot,  the  criterion  function 
approaches  its  lowest  value  as  g  becomes  large  in  absolute  value.  Even  in  the  lower  plot 
the  criterion  function  asymptotes  to  a  local  minimum  for  large  negative  values  of  g.  The 
numerical  search  used  to  implement  the  estimator  could  be  complicated  by  the  flat  sections 
of  the  criterion  function  and  the  search  routine  could  end  up  spuriously  searching  in  the 
direction  of  very  large  values  of  g.  When  the  parameter  vector  is  of  low  dimension,  this  can 
easily  be  assessed  by  gridding  the  parameter  vector  and  evaluating  the  criterion  function  at 
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the  grid  points.  This  is  what  we  did  to  make  sure  that  large  estimates  of  g  were  not  due  to 
numerical  problems.  However  when  the  parameter  vector  is  of  large  dimension,  implementing 
the  continuous-updating  estimator  may  sometimes  be  difficult. 

6      Monte  Carlo  Results,  Markov  Chain  Models 
6.1      Time  Separable  Preferences,  Annual  Data 

First  we  consider  the  results  for  time  separable  preferences  where  {9  =  0)  using  the  Markov 
Chain  model  calibrated  to  annual  data.  For  these  runs  Vr(6)  is  computed  as  a  simple 
covariance  matrix  estimator.  The  resulting  p- value  plots  are  depicted  in  figures  6.1  through 
6.6  for  different  combinations  of  the  preference  settings  and  moment  sets  given  in  tables  3.2 
and  3.3.  Featlires  of  the  Montfe  Carlo  distributions  for  the  point  estimates  of  7  are  Reported 
in  table  6.1. 

Results  for  7  =  1.3  and  6  =  .97 

We  start  by  discussing  the  results  obtained  using  the  more  "'moderate"  values  of  the 
preference  parameters  TSl  which  were  used  by  Tauchen  (1986).  Figure  6.1  includes  a  repli- 
cation of  findings  in  Tauchen  (1986)  using  moment  conditions  Ml.  It  is  an  example  in  which 
it  is  known  from  Tauchen  that  the  over-identifying  restrictions  test  '"under  rejects."  This 
phenomenon  can  be  seen  in  the  upper  left  plot  in  figure  6.1  by  noting  that  the  dot-dash  line 
is  below  the  45  degree  line.  Since  the  minimized  value  of  the  continuous-updating  estimator 
is  always  less  than  or  equal  to  that  of  the  iterative  estimator  (when  the  iterative  estimator 
converges),  we  expect  the  under  rejection  to  be  more  pronounced  when  the  over-identifying 
restrictions  tests  is  based  on  the  continuous-updating  estimator.  Indeed,  the  under  rejection 
is  more  substantial  for  both  the  two-step  and  continuous-updating  estimators.  Interestingly, 
the  underlying  central  limit  approximation  for  the  continuous-updating  estimator  looks  quite 
good  as  depicted  by  the  upper  right  plot  in  figure  6.1.  The  criterion-function  based  confi- 
dence sets  are  evaluated  in  the  lower  left  plot  in  figure  6.1.    The  confidence  sets  based  on 


the  continuous-updating  estimator  do  not  contain  7  as  often  as  predicted  by  the  asymptotic 
theory  as  might  be  anticipated  from  the  downward  distortion  of  the  minimized  criterion 
functions.  In  contrast,  the  limit  theory  works  well  for  the  confidence  intervals  based  on  the 
Wald  statistic  when  the  continuous-updating  estimator  is  used. 

Consider  next  the  set  of  eight  moment  conditions  M2.  Given  the  large  number  of  mo- 
ment conditions  relative  to  sample  size,  it  is  not  surprising  that  the  underlying  central  limit 
approximations  are  much  less  accurate  (see  the  upper  right  plot  in  figure  6.2).  While  the 
asymptotic  approximations  are  better  with  the  continuous-updating  estimator,  at  least  for 
smaller  probability  values,  the  criteria  evaluated  at  the  true  parameters  still  are  too  large 
when  compared  to  the  magnitudes  predicted  by  the  asymptotic  theory.  On  the  other  hand, 
the  minimized  criteria  %^cti<3iis  are  much  better  behaved,  especially  for  the  continut»»as-  ' 
updating  estimator  with  probability  values  less  than  .15  (see  the  upper  left  plot  in  figure 
6.2).  Confidence  intervals  based  on  criteria  function  behavior  performed  poorly  in  this  set- 
ting, although  they  performed  better  for  the  continuous-updating  estimator  than  for  the 
other  two  estimators.  Moreover,  the  criterion-function  based  confidence  intervals  for  the 
continuous-updating  estimator  proved  to  be  more  reliable  that  the  confidence  intervals  based 
on  the  Wald  statistic. 

Figures  6.3  and  6.4  report  plots  for  the  sets  of  four  moment  conditions  M3  and  M4.  The 
reduction  in  moment  conditions  (relative  to  M2)  leads  to  an  improvement  in  the  underly- 
ing central  limit  approximations  depicted  in  the  upper  right  portions  of  the  figures.  This 
is  especially  true  for  the  continuous-updating  estimator  used  in  conjunction  with  moment 
conditions  M4.  The  minimized  criterion  function  values  used  to  test  the  over-identifying 
restrictions  behave  as  predicted  by  the  asymptotic  theory  for  all  three  estimators  when  mo- 
ment conditions  M3  are  used.  In  contrast  there  is  substantial  under  rejection  for  all  three 
estimation  methods  when  moment  conditions  M4  are  used.  Interestingly,  the  dot-dashed  line 
is  below  the  solid  line  in  the  upper  left  plot  in  figure  6.4.  This  means  that  the  minimized 
values  of  the  criteria  for  the  continuous-updating  estimator  are  not  always  below  those  of 
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the  iterative  estimator.  The  reason  for  this  apparent  anomaly  is  that  the  iterations  on  the 
weighting  matrix  often  did  not  converge  for  this  experiment.  The  criterion-function-based 
confidence  intervals  work  quite  well  for  the  continuous-updating  estimator  when  moment 
conditions  M3  are  used  and  both  methods  for  constructing  confidence  intervals  work  well  for 
the  two-step  estimator  when  moment  conditions  M4  are  used. 

In  summary,  with  the  possible  exception  of  the  over-identifying  restriction  test  using 
moment  conditions  Ml,  the  asymptotic  approximations  for  inferences  based  on  the  iterative 
estimator  perform  worse  than  the  corresponding  approximations  for  the  other  two  estimators. 
In  comparing  the  continuous-updating  estimator  to  the  two-step  estimator,  we  found  the 
following.  First  the  asymptotic  distribution  for  the  over-identifying  restrictions  tests  is  more 
reliable  when  based  on  the  Ifcntinuous-updating  ^sstimator.  Second,  for  both  estiniafegrS'l*' 
confidence  intervals  constructed  based  on  the  original  criterion  functions,  are  often  distorted, 
but  with  the  exception  of  Ml  they  are  no  less  reliable  and  often  more  reliable  than  confidence 
intervals  constructed  via  quadratic  approximations. 

Of  course,  assessing  the  reliability  of  the  asymptotic  theory  as  applied  to  the  different 
parameter  estimators  is  a  different  question  than  assessing  the  performance  of  the  parameter 
estimators  themselves.  In  regards  to  this  latter  question,  the  results  in  the  first  portion  of 
table  6.1  show  that  the  continuous-updating  estimator  tends  to  have  either  comparable  or 
considerably  less  bias  in  the  medians  than  the  other  two  estimators.  On  the  other  hand,  the 
dispersion  in  the  estimators  as  measured  by  the  width  between  the  .10  and  .90  quantiles  is 
always  less  and  often  much  less  for  the  iterative  estimator  than  for  the  continuous-updating 
estimator. 

Results  for  -  =  13.7  and  S  =  1.139 

Next  we  consider  results  using  the  preference  specification  considered  by  Kocherlakota 
(1990b)  (TS2  in  table  3.1).  We  only  consider  the  performance  of  6' MM  estimators  obtained 
using  moment  conditions  M3  and  M4.   Our  results  are  displayed  in  figures  6.5  and  6.6  and 


table  6.1.  With  this  change  in  parameter  configuration,  the  results  for  the  M3  moment 
conditions  are  similar  to  those  in  figure  6.3.  However  under  the  M4  moment  conditions, 
there  is  no  longer  evidence  of  under  rejection  of  the  over-identifying  restrictions.  As  before, 
the  weighting  matrices  for  the  iterative  method  often  failed  to  converge  for  the  M4  runs. 
In  contrast  to  our  earlier  findings,  the  limit  theory  no  longer  provides  a  good  guide  for 
the  coverage  probabilities  for  the  criterion-function-based  confidence  sets  when  the  two-step 
estimator  is  used  in  conjunction  with  moment  conditions  M4  (compare  figures  6.4  and  6.6). 
In  regards  to  the  parameter  estimates,  the  continuous-updating  estimator  again  has  less 
median  bias  than  the  other  two  estimators  but  more  dispersion  (as  measured  by  the  distance 
between  quantiles.) 

6.2     Time  SeparaBle  Preferences,  Monthly  Data 

To  e.xplore  the  extent  to  which  the  limiting  distribution  provides  a  better  guide  for  inference 
for  larger  sample  sizes  (with  less  extreme  data  points),  we  redid  some  our  calculations  using 
simulations  calibrated  to  monthly  data  as  described  in  subsection  3.2.    We  focused  exclu- 
sively on  the  more  "moderate"  preference  configuration  adjusting  8  accordingly.  In  this  case 
we  only  looked  at  estimators  constructed  using  moment  conditions  M2  and  M3.  We  are  par- 
ticularly interested  in  moment  set  M2  because  of  its  common  use  in  practice  when  analyzing 
post  war  data.    Our  results  are  reported  in  figures  6.7  and  6.8  and  table  6.2.    Notice  that 
all  of  the  asymptotic  approximations  are  consistently  reliable  for  the  continuous-updating 
estimation  method.  In  sharp  contrast,  large  sample  inferences  for  the  two-step  estimator  are 
of  particularly  poor  quality  with  the  exception  of  the  over- identifying  restrictions  test  using 
M3.  .\lso.  of  note  is  that  the  iterative  estimates  and  the  continuous-updating  estimates  are 
very  close  to  one  another  when  M3  is  used.   This  is  reflected  in  quantiles  reported  in  table 
6.2  as  well  as  in  the  "Minimized"  and  "Constrained  -  Minimized"  graphs.   Presumably,  the 
reason  for  this  is  that  the  weighting  matrix  tends  to  be  a  relatively  "flat"  function  of  the 
parameters. 
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In  regards  to  the  parameter  estimates  of  7,  both  the  continuous-updating  estimator  and 
the  iterative  estimators  have  distributions  that  are  much  more  concentrated  around  the  true 
parameter  value  than  the  distributions  for  the  two-step  estimator  (again  see  table  6.2).'^ 
These  particular  Monte  Carlo  experiments  are  ones  in  which  the  unconditional  moment 
restrictions  provide  much  more  identifying  information  about  the  power  parameter  7  than 
any  of  the  other  experiments  we  report  on.  Not  only  are  estimates  more  accurate  than  the 
estimates  obtained  from  the  Monte  Carlo  experiments  calibrated  to  annual  data,  but  also 
the  estimates  from  the  log-normal  Monte  Carlo  experiments  reported  in  section  5  which  have 
the  same  sample  size.^'^ 

6.3      Time  Nonseparable  Preferences,  Annual  Data 

As  we  discussed  in  subsection  6.1,  the  continuous-updating  estimator  generally  provides  more 
reliable  inference  in  the  case  of  time  separable  preferences  when  data  are  generated  from  the 
annual  Markov  Chain  model.  However  even  for  that  estimator,  it  is  only  when  moment  con- 
ditions M.3  are  used  that  the  distributions  of  the  criteria  TJj  ("Minimized"),  TJj  ("True") 
and  T{Jj  —  Jj)  ("Constrained-Minimized")  accord  well  with  the  corresponding  chi-square 
distributions.  For  these  reasons  we  consider  results  with  time  nonseparable  preferences  using 
only  moment  conditions  M3  and  only  for  the  continuous-updating  estimator.  To  construct 
our  Monte  Carlo  data  sets  we  used  the  three  time  nonseparable  settings  of  the  parameters 
listing  in  table  3.2  as  TNSl,  TNS2  and  TNS3  {9  =  1/3,^  =  -1/3,^  =  -2/3).  We  also  used 
data  generated  with  ^  =  0,  but  still  estimated  the  parameter  9.  Unlike  the  case  of  time 
separable  preferences  when  the  restriction  ^  =  0  is  imposed  in  the  estimation,  the  estimator 


'-The  performance  of  the  two-step  estimator  could  potentially  be  improved  by  using  a  different  weighting 
matri.K  in  the  first-step.  For  example,  the  residuals  from  nonlinear  two-stage  least  squares  applied  to  each 
Euler  equation  could  be  used  to  estimate  the  asymptotic  covariance  matrix  of  the  moment  conditions. 
Another  possibility  is  to  use  the  covariance  matrix  of  the  prices  of  the  "synthetic"  securities  implicit  in  the 
use  of  instrumental  variables.  See  Hansen  and  Jagannathan  (1993)  for  a  discussion  of  this  weighting  matrix. 

'^Presumably,  the  main  reason  for  this  disparity  is  that  the  Markov  chain  models  are  calibrated  to  div- 
idend behavior  rather  than  return  behavior.  As  is  well  known  from  the  empirical  asset  pricing  literature, 
the  dividend  calibrations  imply  returns  that  are  less  volatile  than  historical  time  series  because  of  some 
fundamental  model  misspecification. 
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Vrib)  of  the  asymptotic  covariance  matrix  accommodates  an  MA(1)  structure  in  the  Euler 
equation  errors.  As  in  section  5  we  used  the  Vrib)  estimator  given  in  (5.1)  except  when  it 
was  not  positive  definite  in  which  case  we  shifted  to  Durbin's  (1960)  estimator  with  a  4th 
order  autoregression. 

Figure  6.9  presents  the  p- value  plots  for  the  criterion  functions  for  the  different  settings 
for  0.^'*  In  contrast  to  the  time  separable  case  (the  solid  line  in  figure  6.3),  the  distribution 
of  the  minimized  criteria  imply  small  sample  over-rejection  of  the  moment  conditions  for 
each  of  the  settings  for  $.  Further,  even  when  evaluated  at  the  true  parameters,  the  criterion 
functions  are  not  distributed  as  a  chi-square.  This  occurs  for  all  four  settings  of  6  including 
^  =  0  (time  separable  preferences).  Evidently  the  estimator  of  the  asymptotic  covariance 
matrix  of  the  moment  cc^itidns,  which  assumes  an  MA(1)  structure  for  the  errors,  caiSses 
small  sample  distortion  of  the  GMM  criterion  function.  Notice  that  the  distributions  of  the 
VVald  statistics  are  very  far  from  being  chi-square  .  Consistent  with  the  results  reported 
in  section  5  and  subsections  6.1  and  6.2,  confidence  intervals  for  7  constructed  using  the 
criterion  function  perform  much  better  than  those  based  on  the  Wald  statistic. 

Table  6.3  reports  statistics  summarizing  the  properties  of  the  estimators  of  7  and  0.  The 
estimator  of  7  does  not  in  general  perform  as  well  as  in  the  time  separable  preference  case 
(see  table  6.1  for  the  comparison).  The  median  for  77  is  substantially  below  the  true  value 
of  7  in  the  case  of  ^  =  0  and  9  =  —1/3  and  above  for  9  =  1/3.  Further,  the  dispersion  of 
the  estimators  of  7  is  considerably  larger  than  when  time  separability  is  correctly  imposed, 
at  least  in  part  due  to  having  to  estimate  an  additional  parameter  and  to  accommodate 
\1.\(1)  terms  in  the  estimator  of  the  asymptotic  covariance  matrix  V'(6).  Regarding  the 
estimators  of  9,  there  is  substantial  dispersion  for  each  case  as  evidenced  by  the  10%  and 
909c  quantiles.  Notice  further  that  there  is  some  median  bias  in  the  estimators  of  9  for  the 
cases  of  ^  =  1/3,  0  and  -1/3  (panels  A.  B  and  C  of  table  6.3).  In  summary,  the  annual  data 


^■^The  Monte  Carlo  e.xperiments  used  to  construct  these  results  are  independent  across  the  different  values 
of  6.  The  p- value  plots  of  Figures  5.1,  5.2  and  6.1-6.8  considered  results  for  fixed  preference  parameters  and 
the  three  different  estimators.  The  same  Monte  Carlo  data  was  used  for  the  experiments  for  each  estimator 
in  these  plots. 
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do  not  permit  simultaneous  estimation  of  9  and  7  with  any  reasonable  precision,  at  least  for 
moment  conditions  M3. 

6.4      Time  Nonseparable  Preferences,  Monthly  Data 

Using  the  Monthly  Markov  chain  model  we  also  examined  the  time  nonseparable  model 
for  9  —  1/3,  0  and  -1/3  and  for  moment  conditions  M3.  We  further  considered  moment 
conditions  M2  since  these  conditions  are  often  used  in  practice  and  since  the  continuous- 
updating  estimators  demonstrated  reasonable  small  sample  properties  under  time-separable 
preferences.  We  did  not  consider  the  case  of  ^  =  —2/3  with  the  monthly  model  because  the 
Markov  chain  model  implies  that  the  covariance  matrix  of  the  Euler  equation  errors  is  close 
to  being  singular  at  this  valu^f  9^  Once  again  we  used  the  estimator  of  V{b)  given  by  (5.1). 
When  Durbin's  (1960)  estimator  was  necessary  we  used  a  r2th  order  autoregression. 

Figure  6. 10  reports  the  results  for  criterion  functions  using  moment  conditions  M2.  Under 
M2,  the  minimized  criterion  performs  reasonably  well  for  all  three  settings  of  9.  Tests  of 
the  overidentifying  restrictions  of  the  model  have  the  correct  small  sample  size  in  this  case. 
Notice  further  that  the  criterion  T{Jj  —  Jj)  ("Constrained-Minimized")  is  close  to  being 
chi-square  distributed,  but  that  the  distribution  of  the  Wald  statistic  is  very  far  from  chi- 
square.  Hence,  we  continue  to  find  that  inferences  based  directly  on  criterion  functions  are 
much  more  reliable  than  those  based  on  quadratic  approximations  to  the  criterion  functions. 
Finally,  we  report  summary  statistics  of  the  central  tendency  of  the  estimators  of  7  and  9  in 
table  6.4,  panels  .\.  B  and  C.  Lack  of  prior  knowledge  of  the  parameter  9  again  causes  the 
estimator  of  -,  to  be  much  less  precise  as  measured  by  the  distance  between  the  90%  and 
10%  quantiles.  (For  instance,  compare  the  first  column  of  table  6.2  to  panel  B  of  table  6.4.) 

Figure  6.11  presents  the  p- value  plots  for  moment  conditions  M3.  In  this  case  the  dis- 
tribution of  the  minimized  criterion  functions  and  the  "Constrained-Minimized"  criteria  are 
not  chi-square.  The  modefs  over  identifying  conditions  are  rejected  too  often  for  ail  of  the 
parameter  settings  and  confidence  intervals  for  the  parameter  -/   have  the  wrong  coverage 
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probabilities.  With  moment  conditions  M3,  allowing  for  time  nonseparability  results  in  a 
substantial  number  of  very  large  estimates  of  7  as  reflected  by  the  size  of  the  90%  quantiles 
(see  table  6.4,  panels  C,  D  and  E).  It  appears  that  the  addition  of  returns  as  instruments 
(the  difference  between  moment  conditions  M2  and  M3)  improves  the  performance  of  the 
estimator  and  the  quality  of  the  central  limit  approximations. 

7      Concluding  Remarks 

In  this  paper  we  examined  the  finite  sample  properties  of  three  alternative  GMM  estimators 
that  differ  in  the  way  in  which  the  moment  conditions  are  weighted.  Particular  attention 
was  paid  to  both  the  performance  of  tests  of  over-identifying  restrictions  and  to  comparing 
alternative  ways  of  consti^ctmg  apnfidence  sets.  In  documenting  finite  sample  properties, 
we  used  several  different  specifications  of  the  consumption-based  CAPM.  The  experiments 
differed  substantially  in  the  amount  sample  information  there  is  about  the  parameters  of 
interest.  While  the  experiments  do  not  uniformly  support  the  conclusion  that  one  estimator 
dominates  the  others,  some  interesting  patterns  emerged. 

•  Continuous-updating  in  conjunction  with  criterion-function  based  inference  often  per- 
formed better  than  other  methods  for  annual  data,  however  the  large  sample  approxi- 
mations are  still  not  very  reliable. 

•  In  monthly  data  the  central  limit  approximations  for  the  continuous-updating  esti- 
mation method  applied  in  conjunction  with  the  criterion  function-based  method  of 
inference  performed  well  in  most  of  our  experiments,  including  ones  in  which  the  pa- 
rameters are  estimated  very  accurately  and  ones  in  which  there  is  a  substantial  amount 
of  dispersion  in  the  estimates. 

•  Confidence  intervals  constructed  using  quadratic  approximations  to  the  criterion  func- 
tion performed  very  poorly  in  many  of  our  experiments. 
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•  The  continuous-updating  estimator  typically  had  less  median  bias  than  the  other  es- 
timators, but  the  Monte  Carlo  sample  distributions  for  this  estimator  sometimes  had 
much  fatter  tails. 

•  The  tests  for  over  identifying  restrictions  are,  by  construction,  more  conservative  when 
the  weighting  matrix  is  continuously  updated,  and  in  many  cases  this  led  to  a  more 
reliable  test  statistic.  (However,  we  made  no  attempts  at  comparing  power  even  for 
size  corrected  tests.) 

Our  Monte  Carlo  experiments  for  monthly  data  were  sufficiently  successful  to  convince 
us  to  reexamine  some  of  the  empirical  evidence  for  the  consumption-based  CAPM.  In  most 
tests  of  the  consumption-based  CAPM,  the  model's  over  identifying  conditions  ar€;¥eje?ted 
[see,  for  example,  Hansen  and  Singleton  (1982)].  Since  the  two-step  or  iterative  estimator 
is  typically  used  in  practice,  one  potential  explanation  for  these  rejections  could  be  the 
tendency  of  these  estimators  to  result  in  over  rejection  of  the  model  in  small  samples.  To 
assess  this  possibility  we  estimated  the  time  separable  and  time  nonseparable  models  using 
the  continuous-updating  estimator.  We  used  the  consumption  and  return  data  described  in 
subsection  3.1  along  with  moment  conditions  M2  given  in  table  -3.3. 

Estimation  of  the  time  separable  model  resulted  in  point  estimates  of  6  and  7  of  0.25  and 
720.65  respectively.  This  is  an  example  where  the  tail  behavior  of  the  criterion  results  in  large 
estimated  value  of  7.  The  minimized  GMM  criterion  was  5.94  with  an  implied  p-value  of 
0.43  hence  it  appears  that  the  continuous-updating  estimator  implies  that  the  model  is  not  at 
odds  with  the  data.  However  the  estimate  parameters  are  very  far  from  being  economically 
plausible.  .\s  we  found  in  several  of  our  Monte  Carlo  experiments,  with  the  continuous- 
updating  estimator,  extreme  point  estimates  of  the  parameters  are  possible.  However  in 
those  cases  there  typically  was  little  deterioration  in  the  criterion  function  when  evaluated 
near  the  true  parameter  values  so  in  practice  it  is  important  to  evaluate  the  criterion  tunction 
at  plausible  values  of  the  parameters.    In  this  case  we  restricted  7  to  the  range  [0  20]  and 
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estimated  8  for  each  hypothetical  value  of  7.  The  resulting  minimized  criterion  as  a  function 
of  7  is  plotted  in  the  top  panel  of  figure  7.1.  Notice  that  for  this  range  of  7  the  minimized 
criterion  function  is  well  above  30  where  the  implied  p- value  is  essentially  zero.  As  a  result, 
once  a  plausible  set  of  parameters  is  considered,  the  model  is  still  rejected  when  using  the 
continuous-updating  estimator. 

In  estimation  of  the  time  nonseparable  model  the  point  estimates  of  the  parameters  were 
also  quite  implausible  with  estimates  of  6,  7  and  0  of  1.20,  267.96  and  0.32  respectively. 
The  bottom  panel  of  figure  7.1  presents  the  criterion  function  for  the  continuous-updating 
estimator  with  7  restricted  to  the  range  [0  20].  At  7  =  20  the  criterion  reaches  a  minimum 
of  13.55  with  an  implied  p-value  of  .035.  As  a  result,  even  at  this  extreme  value  for  7  the 
model  is  still  substantialip'at  odds  with  the  data.  ''  i^ 

In  summary,  although  the  continuous-updating  estimator  does  not  save  the  consumption- 
based  CAPM,  the  experiments  that  we  have  presented  provide  evidence  that  it  should  be  of 
use  in  manv  GMM  estimation  environments. 
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Table  3.1:  MLE  of  Monthly  Law  of  Motion 


Order  of  C(L) 

Unrestricted 
Log- Likelihood 

No  Time 

Avg. 

Time  Averaged 

Log-Likelihood 

7 

Log-Likelihood               7 

1 
2 
3 

3220.6 
3237.3 
3238.8 

3214.9 

3218.1 

4.55 
4.21 

3230.3                  4.99 

Table  3.2:  Preference  Settings  for  Discrete  States  Space  Model 


^ 


U  =  E 


t=0 


1-7 


&■' 


Preference  Case 


7 


e 


TSl 

TS2 

TNSl 

TNS2 

TNS3 


0.97 

1.139 

0.97 

0.97 

0.97 


1.3 
13.7 
1.3 
1.3 
1.3 


0 
0 

1/3 
-1/3 
-2/3 


Moment  Set 


Table  3.3:  Moment  Conditions 


Returns 


Instruments,  C( 


#  of  Moment  Conditions 


Ml 
M2 
M3 
M4^ 


RUi-RL  RlR{,ct/ct-,d 

RUi^R{+i  A„i 

RUy.RL  dt/P,\R{.l 


3 

8 
4 
4 


''Lagged  bond  return   used  as  an  instrument  only  for  the  bond  return.     Dividend-price  ratio  used  as  an 
instrument  oniv  for  the  stock  return. 
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Table  5.1:  Properties  of  Estimators  of  7,  Log-Normal  Model 


Median 
Mean 

Truncated  Mean" 
10%  Quantile 
90%  Quantile 

Median 
Mean 

Truncated  Mean'' 
10%  Quantile 
90%  Quantile 


Continuous- Updating 


Iterative 


A.  No  Time  Averaging,  True  7  =  4.55 

3.72  1.64 

7171.17  1.81 

4.47  1.81 
-3.67  -0.23 
18.75  4.00 

B.  Time  .Averaging,  True  7  =  4.99 

3.48  1.75 
-518.14  1.88 

3.18  1.88 

-10.48  -0.29 

11.52  4.22 


Two-Step 


1.73 

2.06 

2.06 

-0.20 

4.33 

1.74 

2.04 

2.04 

-0.66 

4.85 

"Estimates  with  absolute  values  greater  than   100  were  excluded  from  the  computation  of  the  truncated 
means. 
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Table  6.1:  Properties  of  Estimators  of  7,  Markov  Chain  Model,  Annual  Data 


Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  .Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 


Continuous- Updating 


Iterative 


Two-Step 


A. 

TSl-Ml, 

True  7 

=  1.3,*  = 

.97 

0.98 

1.58 

1.53 

1.25 

1.58 

1.53 

1.50 

1.54 

1.58 

-5.81 

-1.91 

-1.83 

7.80 

4.61 

4.59 

fl. 

TS1-M2, 

True  7 

=  1.3,*  = 

.97 

1.90 

0.85 

1.08 

1.90 

0.85 

1.08 

1.15 

0.70 

0.78 

tf' 

0.61 

5.76 

0.13 
1.60 

;?ri5* 

4.09 

C. 

TS1-M3, 

True  7 

=  1.3,*  = 

.97 

1.36 

1.35 

1.35 

1.53 

1.35 

1.35 

1.17 

1.10 

1.25 

0.61 

-1.35 

-2.71 

4.10 

3.51 

5.59 

D, 

TS1-M4. 

True  7 

=  1.3,*  = 

.97 

2.84 

0.78 

0.90 

2.13 

0.78 

0.90 

1.03 

0.35 

0.59 

-0.15 

-1.01 

-2.73 

7,12 

4.21 

5.50 

£.  TS2-M3,  True  7  = 

:  13.7,*  = 

1.139 

6.29 

13.68 

14,10 

14.80 

13.68 

14.10 

14.14 

13.10 

13.43 

8.96 

9.01 

9.85 

22.68 

19.40 

19.54 

F.  '. 

TS2-M4.  '. 

True  7  = 

-.   13.7,*  = 

1.139 

16.94 

10.81 

12.51 

14.78 

10.81 

12.50 

14.34 

10.98 

12.06 

4.47 

2.81 

6.78 

23.68 

18.42 

18.97 

"Estimates  with  absolute  values  greater  than   100  were  excluded  from  the  computation  of  the  truncated 
means. 
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Table  6.2:  Properties  of  Estimators  of  7,  Markov  Chain  Model,  Monthly  Data 

True  7  =  1.3,  6  =  0.97i/'2 


Continuous-Updating 

A. 

TSl 

Iterative 

Two-Step 

-M2 

1.37 

1.25 

1.57 

1.37 

1.25 

1.57 

1.28 

1.18 

1.17 

1.02 

0.96 

-2.12 

1.85 

1.61 

4.63 

B. 

TSl 

-M3 

1.38 

1.38 

1.84 

1.38 

1.38 

1.84 

1.27 

1.27 

1.27 

1.00 

1.00 

-2.95 

1.89 

1.89 

6.50 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean'' 

Median 

10%  Quantile 

90%  Quantile 

"Estimates  with  absolute  values  greater  than   100  were  excluded  from  the  computation  of  the  truncated 
means. 
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Table  6.3:  Properties  of  Estimators  of  7  and  6,  Markov  Chain  Model,  Annual  Data 
Continuous-Updating  Estimator,  True  7  =  1.3,6  =  .97 


Mean 

Truncated  Mean" 

Median 

10%  Quantile  ^- 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 


A.  TNS1-M3,  True  6^  =  1/3 

4.56 

5.99  xlO^ 

4.56 

0.83 

1.71 

0.52 

0.64 

0.10 

10.83 

1.25 

B.  TS1-M3,  True  9  =  0 

5.88 

-0.34 

3.67 

-0.34 

0.73 

-0.12 

0.00 

-0.92 

10.92 

0.18 

C.  TNS2-M3,  True  ^  =  -1/3 

3.05 

-0.46 

2.69 

-0.46 

0.97 

-0.42 

0.01 

-0.93 

8.28 

-0.01 

D.  TNS3-M3.  True  9  =  -2/3 

0.41 

0.07 

2.16 

-0.64 

1.22 

-0.69 

0.04 

-0.94 

5.79 

-0.40 

& 


"Estimates  with  absolute  values  greater  th-^n   100  were  e.xcluded  from  the  computation  of  the  truncated 
means. 
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Table  6.4:  Properties  of  Estimators  of  7  and  9,  Markov  Chain  Model,  Monthly  Data 
Continuous-Updating  Estimator,  True  7  =  1.3,6  =  .97*'''^ 


Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 

Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%'  Quantile 

Mean 

Truncated  Mean" 

.Median 

10%  Quantile 

90%  Quantile 

.Mean 

Truncated  Mean" 

Median 

10%  Quantile 

90%  Quantile 


A.  TNS1-M2,  True  0  = 

1/3 

13.56 

0.35 

2.90 

0.35 

1.27 

0.34 

0.79 

0.22 

7.35 

0.48 

B.  TS1-M2,  True  6  = 

0 

16.68 

-0.01 

4.18 

-0.01 

1.29, 

0.38  ■ 

0.01 
-0.23 

13.46 

0.19 

C.  TNS2-M2.  True  6  = 

-1/3 

6.03 

-0.42 

5.82 

-0.42 

1.24 

-0.35 

0.00 

-0.98 

17.95 

0.09 

D.  TNS1-M3.  True  9  = 

1/3 

153.96 

0.66 

11.09 

0.66 

1.45 

0.37 

0.48 

0.16 

410.29 

1.16 

E.  TS1-M3.  True  9  = 

:0 

62.51 

-0.14 

17.89 

-0.14 

1.40 

0.04 

0.03 

-0.66 

142.79 

0.26 

F.  TNS2-M3.  True  9  = 

-1/3 

12.98 

-0.40 

12.78 

-0.40 

0.35 

-0.58 

0.01 

-0.91 

40.99 

0.17 

■^Estimates  with  absolute  values  greater  than    100  were  excluded  from  the  computation  of  the  truncated 
means. 


43 


Figure  5.1 

Criterion  Functions 

Monthly  Log-Normal  Model,  No  Time  Averaging 
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Figure  5.2 

Criterion  Functions 

Monthly  Log-Normal  Model,  Time  Averaging 

.Iterative, Two-Step, Continuous-Updating 
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Figure  5.3 

Smoothed  Distribution  of  Estimated  Angle  implied  by  (1,  7r] 

Monthly  Log-Normal  Model,  No  Tim^  Averaging 
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Figure  5.4 

Smoothed  Distribution  of  Estimated  Angle  implied  by  (1,  7t] 

Monthly  Log-Normal  Model,  Time  Averaging 


Continuous-Updating 


Iterative 


1      1.5 


Two  Step 


I      1.5 


47 


Figure  5.5 

Criterion  Function  for  Continuous-Updating  Estimator 

Monthly  Log-Normal  Model,  No  Time  Averaging 
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Figure  6.1 
Criterion  Functions 
Annual  Markov  Chain  Model 
.97,  7  =  1.3.  ^  =  0.  Moment  Conditions  Ml. 
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Figure  6.2 

Criterion  Functions 

Annual  Markov  Chain  Model 

0  =  .97,  7  =  1.3,  ^  =  0.  Moment  Conditions  M2. 
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Figure  6.3 
Criterion  Functions 
Annual  Markov  Chain  Model 
.97,  7  =  1.3.  ^  =  0.  Moment  Conditions-  M3. 
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Figure  6.4 
Criterion  Functions 
Annual  Markov  Chain  Model 
.97,  7  =  1.3,  ^  =  0.  Moment  Conditions  M4. 
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Figure  6.5 

Criterion  Functions 

Annual  Markov  Chain  Model 

0  =  1.139,  7  =  13.7,  ^  =  0.  Moment  Conditions  M3. 
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Figure  6.6 

Criterion  Functions 

Annual  Markov  Chain  Model 

l3  =  1.139,  7  =  13.7,  ^  =  0.  Moment  Conditions  M4. 
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Figure  6.7 

Criterion  Functions 

Monthly  Markov  Chain  Model 

l3  =  .97^/>2   7  =  1.3,  ^  =  0.  Moment  Conditions  M2. 
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Figure  6.8 

Criterion  Functions 

Monthly  Markov  Chain  Model 

/?  =  .97^/^2^  7  =  1.3,  ^  =  0.  Moment  Conditioas  M3. 
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Figure  6.9 

Criterion  Functions 

Annual  Markov  Chain  Model,  Continuous- Updating  Estimator 

13  =  .97,  7  =  1.3,  Time  Nonseparable.  Moment  Conditions  M3. 

e  =  1/3 ^  =  0, _ ^  =  -1/3,  ■•■  e  =  -2/3 
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Figure  6.10 

Criterion  Functions 

Monthly  Markov  Chain  Model,  Continuous- Updating  Estimator 

l3  =  .97^/*^,  7  =  1.3,  Time  Nonseparable.  Moment  Conditions  M2. 

......e  =  1/3 e  =  o,_0  ^  -1/3 
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Figure  6.11 

Criterion  Functions 

Monthly  Markov  Chain  Model,  Continuous-Updating  Estimator 

/J  =  .97*/*^,  7  =  1.3,  Time  Nonseparable.  Moment  Conditions  M3. 

e=  1/3 ^  =  0,  _^  =  -1/3 
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